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Abstract. Collaboration networks provide a method for examining the 
structme of eollaborative communities. The model I present in this pa- 
per connects an individual's skill set to her position in the collaboration 
network, and changes in the distribution of skills to the structure of the 
collaboration network as a whole. I show that individuals with a useful 
combination of skills will have a disproportionate number of links in the 
network, resulting in a skewed degree distribution. The degree distribu- 
tion becomes more skewed as problems become more difficult, leading to a 
community dominated by a few high-degree superstars. 

Heterogeneity is important in a wide variety of creative and knowledge-based 
fields, including product development, art and design, entrepreneurship, and 
academic research. Innovation in these areas is driven by diverse groups of 
people, working together on difficult problems. In that context, simple models 
with only a one-dimensional ability level cannot be used to study the institu- 
tional arrangements that are driven by and respond to the multidimensional 
factors that make people different. This kind of complex heterogeneity has 
garnered increasing interest in recent years, in part because it is not well un- 
derstood. In this paper, I present a framework for understanding heterogeneity 
in a context where it is not peripheral, but central: collaborative production. 
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Systems. 
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This allows me to examine the link between rich individual heterogeneity, and 
the complex structure of collaborative communities. 

Heterogeneity is particularly central to collaborative production, because 
while some common ground is required to facilitate communication, comple- 
mentarities provide the real advantage. Collaboration works when individuals 
with diverse skills come together to solve problems that none of them could 
solve alonej^ Collaboration networks provide a way of looking at the structure 
of collaborative communities. The nodes in the network represent individuals 
(eg: academics, researchers, policy-makers, or entrepreneurs) and two individ- 
uals in the network are linked if they have worked together on a problem. In 
this paper, I present the first model of the formation of collaboration networks 
that incorporates individual heterogeneity. I start with a population of prob- 
lem solvers, each of whom is endowed with a set of skills. The individuals 
collaborate on a set of problems, and the result is a collaboration network. 
By drawing direct connection between the distribution of skills in the popula- 
tion and the structure of the collaboration network, I am able to look at how 
the nature of a collaborative community (as measured by the distribution of 
skills in that community) affects the structure of the collaboration network as 
a whole. 

The structure of this network is important for a number of reasons. On 
an individual level, a person's position on the network reflects her role in 
the collaborative community-a person who is located on the periphery of a 
community will play a different role than a person who is more central. A 
person's position in the network is also correlated with her influence in the 



Hong and Page| ( |200l[ ), [Polzer et aL] ( |2002[ ), [Thomas-Hunt et al.| ( |2003[ ), |Phillips et aL 



(20041 



COLLABORATION NETWORK FORMATION WITH HETEROGENEOUS SKILLS 3 

collaborative communityj^ her access to knowledge, tools, and informationj^ 
and her probability of finding a jobj^ On a more global level, the overall 
structure of the collaboration network reflects the nature of the problem solving 
community-a community consisting of several densely-connected clusters will 
have a different character than a community where links are spread evenly. 
Moreover, network structure impacts the function of the community, improving 
or impeding the flow of information and ideas, opinion formation, and the 
spread of effective technologies]^ 

This suggests two interesting classes of questions relating to collaboration 
networks: 

(1) What characteristics of an individual affect her position on the collab- 
oration network? 

(2) What aspects of the collaborative process (eg: institutions, character- 
istics of problems and problem solving communities) affect the overall 
structure of the collaboration network? 

In order to address these questions, I need a model of how collaboration net- 
works form, and, in particular, a model of how individual heterogeneity trans- 
lates into network heterogeneity. 

Existing network formation literature can be divided roughly into two cat- 
egories: statistical models and decision-based models. In statistical models. 



see 



et aL 



Menzel and Katz (1955), DeMarzo et aL (20031, Golub and Jackson (20101, Banerjee 



(2011) 



ackson 



(2008) fo r a su mma ry of this literature 



see 

'^See l Caivo-Armengol ( 2004 1 and Calvo-Armengol and Jack son ( 2004 ) 
^see [j ackson and Yariv ~^07l, Jackson and Rogers "2007 1 for examples, plus a survey of 
the literature 



COLLABORATION NETWORK FORMATION WITH HETEROGENEOUS SKILLS 4 

the process of hnk formation is stochastic. Examples of these types of mod- 
els include Erdos-Renyi random network^ Watts-Strogatz small world net- 
works]^ incumbency model^ and preferential attachment models and their 
derivatives]^ These models of network formation have the advantage of being 
mathematically easy to work with. Preferential attachment and incumbency 
models have the additional advantage that they can be made to reproduce the 
observed distribution of degree, with it's extremely long right tail. However, 
because links in these networks are made randomly, they do not allow us to 
answer questions about incentives and network structure. Moreover, they do 
not incorporate any true individual heterogeneity, and thus cannot be used to 
suss out which individual characteristics impact an individual's position in the 
network F°l 

In decision-based models, individuals in the network choose their network 
connections in a way that maximizes an objective function. For example, in the 
Connections Modelj^ players gain a benefit for both direct and indirect links 
(the benefit to indirect links decays across multiple links) and pay a cost for 
each direct link made. In the Coauthor Modelf^ players are must allocate their 
effort across multiple projects where the payoff from a paper is inversely related 
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(2001), firms choose a set of links and an effort level to put into research 
and development, given that such links general both perfect and imperfect 
spillovers. These types of models provide more insight into the link between 
incentives and network structure than purely statistical models. However, 
in all of these models, the individual agents are homogeneous, and thus the 
resulting networks are symmetric. Because individuals in these models are 
homogeneous, they can't tell us anything about the heterogeneity of network 
position we observe empirically. Moreover, the resulting networks don't look 
much like empirical social networks, and thus they can tell us only a limited 
amount about global network structure. 

The model I present in this paper is decision-based, and it is the first such 
model of network formation to incorporate individual heterogeneity. Each in- 
dividual in the model has a set of skills, drawn from a larger poolj^ That set 
of skills represents the human capital of the problem solver-the tools, tech- 
niques and knowledge that are useful inputs to knowledge-based production. 
Any individual may have a subset of those skills, but it is unlikely that a single 
person will have all of them. Each individual in the community is endowed 
with their own problem and the collaboration network forms when individuals 
find collaborators to help them solve those problems. 

Using this model, I show that an individual's position on the network is a 
highly non-linear function of her set of skills. In particular, the number of 
connections an individual has on the network (her degree) is a supermodu- 
lar function of her set of skills. In other words, an individual with a useful 
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example, Roy ( 1951 ) gives an individual a skill level as a "hunter" and a "fisherman". This, 
in essence, is equivalent to assuming that individuals have ability levels in two different 
skills. There has been recent work that explicitly starts to migrate towards a model where 
individuals can utilize both skills for tasks (rather than simply using one or the other). 
Examples include Lazear (2004 2005) and Astebro and Thompson (2011). However, all of 
these works confine their analysis two skills. Here, I consider skill sets of arbitrary size. 



COLLABORATION NETWORK FORMATION WITH HETEROGENEOUS SKILLS 6 

combination of skills will have a disproportionate number of links on the col- 
laboration network. Thus, the relationship between an individual's set of skills, 
and her importance to the collaborative community cannot be captured by a 
linear pricing structure. 

The importance of combinations of skills has a dramatic affect on the overall 
structure of the collaboration network. Using a special case called the Bernoulli 
Skills Model, I show that even when skills are distributed independently, the 
distribution of degree in the network will be highly skewed-a few individuals 
will have a large number of links, while the vast majority of individuals will 
have very few. Small initial differences in individual skill sets create large 
differences in the distribution of links, meaning that the distribution of links 
in the collaboration network may be skewed, even when the distribution of 
skills is not. This skewed degree distribution closely matches the distribution 
that we observe in empirical collaboration networks. 

Moreover, because the model creates a direct link between the skill popu- 
lation and the collaboration network, I can use comparative statics to make 
predictions about how changes in the population of problem solvers will change 
the structure of the network as a whole. Using the Bernoulli Skills Model, I 
show that as problems become more difficult, the degree distribution of the 
network becomes more skewed, and the network becomes dominated by a few, 
high-degree superstars. Using a second case called the Ladder Model I observe 
another pattern-the network becomes increasingly skewed as skills become 
increasingly hierarchical. 

Finally, it is worth nothing that the framework and results I present here 
have broader implications for our understanding of heterogeneity in labor mar- 
kets. I conclude with a brief discussion of these issues. 
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1. A General Model of Skills, Problem Solving, and 
Collaboration Networks 



1.1. Inputs: Problem Solving Population and Problems. Let / = {1, 2, ... 

be the set of problem solvers (individuals). 

Let S = {ai...aM} denote the (finite) set of all skills. 

A individual z's skill set is the subset of those skills she possesses, Ai C S. 
The frequency of skill set A in the population is given by ^^(A), a proba- 
bility measure with support S (\1') C 2"^-that is, \E' {A) is the fraction of the 
individuals in I who have the skill set A C 

Each individual is endowed with a copy of a problem requiring a subset of 
the skills available in the population, u C 

Thus, the inputs to the model are a set of skills used to solve problems (S) 
and a population of problem solvers (\1'). 

1.2. Collaboration and Problem Solving. A collaboration is a subset of 
the problem solvers, C C /. A collaboration can solve a problem if together 
they possess all of the required skills-that is, if the problem is solved if a; C 

The problem yields a payoff of 1 if solved. If a problem solver can solve her 
problem alone (that is, if Ai = u) then she keeps the entire payoff. If she solves 
it with the help of others, she splits the payoff evenly with her collaborators, 
giving each a share of j^,^^ Since others in the community face their own 

"'^^Formally, is a frequency distribution, rather than a probability distribution-that is, 'J is 
a realized distribution of skill sets across the population. The distinction between frequency 
and probability distributions disappears when N is large-using a frequency distribution 
allows me to also make statements about small N as well. 

""^^For simplicity, I assume that all individuals face the same problem. The results are 
identical if each individual faces a different problem drawn from a known distribution of 
problems, fl, with support 2"^, and collaborators are chosen ex ante. If they choose their 
collaborators ex post, then the results are similar, if notationally more complex. 
"'^''This particular distribution scheme has a number of points to recommend it: it does not 
require that agents have control over the way payoffs are split (advantageous when payoffs 



COLLABORATION NETWORK FORMATION WITH HETEROGENEOUS SKILLS 8 

problems, a problem solver may be asked to help with other problems as well. 
Thus, individual i's payoff is the sum the payoff she gets from solving her 
own problem, plus the payoffs she gets from collaborating with others on their 
problems: 



An individual chooses her set of collaborators (Cj) to maximize her utility. 
Note that her payoff to solving her own problem is always positive, and thus 
it is always incentive compatible for her to solve the problem. Since each 
individual controls only her own collaborative decisions, the utility-maximizer 
chooses Ci to minimize the number of collaborators she must work with on 
her own problem-in other words, she chooses a minimal subcover of the set of 
skills she lacks-A^ = uji\Ai. Let Cj denote the set of all minimal subcovers of 
A1. If there exist multiple minimal subcovers (ie: if |Cj| > 1) then I assume 
that the individual chooses C* G Cj at random]^ 

It is worth noting that while I have chosen to model the motivate the de- 
cision to minimize the set of collaborators via an equal split of payoffs, the 
results that follow are unchanged under any distribution scheme which in- 
duces problem solvers to minimize the number of collaborators they use. So, 
for example, if collaborating with additional individuals is costly (eg: in terms 
of time or communication), then the results that follow will still hold with no 



are non-monetary), and it gives each individual her Shapely value for the coalition. Many 
alternative splitting schemes will be behaviorally identical (see below), 
"'^'''it is important to highlight that this payoff is not equivalent to a market wage. A model 
producing equilibrium wages would require considerable additional machinery, including 
time constraints and search costs, and is therefore beyond the scope of the current work. 
"'^'^Since the individual indifferent between minimal subcovers, this choice at random follows 
convention. The results that follow are not sensitive to this assumption. In particular, if an 
individual is biased towards choosing collaborators who are not as busy (eg: those with fewer 
links) then the results that follow hold precisely, rather than on average. If an individual 
is biased towards choosing collaborators who are popular (eg: have many links) than the 
results that follow are qualitatively similar, and if anything are more striking. 
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modification. The results will not hold under any payoff structure where it is 
not individually rational to minimize the set of collaborators used to solve a 
problem. 

1.3. Complementary Skills Networks. For a given a set of collaborations, 
C = {Ci...Ciy}, the collaboration network is represented by an adjacency ma- 
trix, g (C), where Qij (C) = 1 if j G Cj. Note that the network is directed-since 
j G Ci does not necessarily imply i G Cj, it may be that gij (C) 7^ gji (C). 
However, the links are mutual, in the sense that j will never want to terminate 



a link (see Section 1.6.1 for further discussion). When all collaborators are cho- 
sen optimally (that is, when Ci G Cj Vz), I will call the result a complementary 
skills network (or complementarity network for short). 

Definition. A network, g{C), is a complementary skills network (comple- 
mentarity network) if each individual in the network chooses a minimal set of 
collaborators required to solve her problem-eg: if Ci G CjVi. 

Since the set of minimal subcovers each individual (Ci), depends on the 
distribution of skills in the population, I use F (^) to denote the set of com- 
plementary skills networks for a particular distribution of skills, \E'. In other 
words, F (\E') is a set of networks over which all individuals are maximizing 
utility. In all of the measures that follow, I will average over all networks in 

r(^)H 

1.4. Network Measures. Recall that I will be looking at two different ques- 
tions: first, how an individual's position on the network depends on her set 



"'^^Ironically, the great strength of social networks (their rich heterogeneity) also makes them 
difficult to work with. One method for simplifying this problem is to divide networks into 
equivalence classes of networks that are similar in some meaningful way. Here, the networks 
are divided into equivalence classes according to the individual agents' indifference condition. 
That this division is not arbitrary-but rather a nature result of the decision rule-is one of 
the great strengths of this particular model. 



COLLABORATION NETWORK FORMATION WITH HETEROGENEOUS SKILLS 10 

of skills, and second how the overall structure of the network depends on the 
distribution of skills in the population. For both of these questions, I will focus 
on degree: the number of connections an individual has on the network. 

First, I will look at how an individual's average in-degree on the networks 
in r (\E') depends on her skill set. I will denote individual z's in-degree on 
a particular network as di. Her average in-degree will be E[di\, where the 
expectation is taken over all networks in F (\E')-that is, all networks that the 
individual is indifferent between. In Section [2| I will consider the mapping 
between an individual's skill set an her expected in-degree in a particular 
problem-solving population, E [di\ = f {Ai, \E'). 

I will also consider how the overall distribution of in-degree depends on 
the distribution of skills in the population. Let A denote the distribution of 
expected in-degree. That is, A (d) is the fraction of the individuals who have 
expected in-degree d, where the expectation is taken over all g (C) G F 

Before continuing, a brief word about network notation is in order. First, 
note that for ease of reading I will usually drop the argument of g (C). I will 
denote a link from i to j by ij. Using a slight abuse of notation, 1 will use g 
to refer to both the adjacency matrix (as above) and the set of links in the 
network-that is, ij G (7 if i is connected to j in the network g. In a similar 
abuse of notation, I will use g — ij to represent the network that results when 
the link ij is removed from an existing network, g, and g + ij to represent 
the network that results when the link ij is added to the existing network, 
g. Finally, for clarity in the exposition, I will refer to in-degree simply as 
"degree''^ 

^'^Alternatively, we might plot the distribution of degree across all networks 5 G F ('5). That 
is, we could set A (d) — X]ger(*) i^) where Sg (d) is the fraction of nodes in network g 
with degree d. This choice does not affect the results. 

^^This should cause no significant confusion. I use in-degree because it has a clear, empirical 
interpretation, but the results qualitatively similar if we consider an individual's degree in 
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Problem (co): Develop a web application Distribution of skills in the population (W): 



Skills required: a = programming 

b = user interface design 
c = marketing 



© ® © © 
© ® © 
© © © 
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Figure 1.1. An example of a problem and population of prob- 
lem solvers 

1.5. Example. An example will help clarify the structure of this model. Sup- 
pose all of the individuals in the population face the same problem requiring 
three skills: u = {a, b, c} Vz -for example, developing a web application might 
require programming skills, user interface design skills, and marketing skills. 
Suppose the distribution of skills is such that everyone in the population has at 
least one skill, but no one has all of the skills required. In other words, every in- 
dividual in the population has something to add, but none of them can solve the 
problem on their own. Suppose further that each skill combination is equally 
likely-in other words, \E' (A) = ^ for all A G {{a} , {b} , {c} , {ab} , {ac} , {be}} 



and (A) = otherwise (see Figure 1.1). In this particular population of 



problem solvers, each individual needs exactly one collaborator to solve the 
problem-that is, someone with skill set {a} must link to someone with the 
skill set {b,c}, someone with skill set {b,c} may choose from those with skill 
sets {a} , {a,b}, and {a,c}, and so on. The problem solver is indifferent be- 
tween any two individuals who have the skills that she needs. By linking those 
who collaborate on problems, we obtain a collaboration network. There will be 



one such network for every set of optimal collaborations. Figure 1.2 shows two 



collaboration networks for this population of problem solvers and two optimal 



the directed network (the sum of his in-degree and out-degree), or use the degree of the 
individual in a network where directed links are projected into undirected links. 
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Figure 1.2. An example of two networks formed from popula- 
tion/problem in Figure [lT] 

choices of collaborators. The set of all such networks for a given population of 
problem solvers is denoted by F (\E'). 



1.6. Discussion. This model produces outcomes consistent with several em- 
pirical facts about collaboration and problem solving. First, the model predicts 
that collaboration will be more important when the average individual has a 
thinner slice of the total skills required to solve a problem-in other words, 
as problems become more difficult, collaboration networks will become more 
densely connected. This prediction is born out in the data-collaborative work 
has become increasingly common in a variety of academic fieldsj^ More- 
over, the literature supports a connection between this increased reliance on 
collaboration and the difficulty of problems faced, specifically the increasing 
complexity of required methodologiesj^ This highlights one of the advantages 
of using this model of collaboration network formation: statistical models of 
network formation-including preferential attachment-assume constant degree 
over time, and thus cannot be used to make predictions about the relationship 
between problem difficulty and collaborative effort. 
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1.6.1. A Note on Stability and Efficiency of the Complementarity Network. 
Before considering any specific questions about the complementarity network, 



it is worth considering it's stabihty and efficiency. Jackson and Wohnsky 



(1996) introduce an equihbrium concept of network stabihty, caUed pairwise 
stability. Briefly, a network is pairwise stable if no individual would prefer 
to terminate an existing link, and if no pair of individuals would prefer to 



add a link (see Appendix A.l for a more formal definition in the case of a 
directed network). Pairwise stability implies that links are mutual, because 
both individuals involved agree to maintain the link. Theorem [T] states that 
any complementary skills network is pairwise stable, implying that all links 
in the network are mutual]^ Moreover, any complementary skills network is 
strongly efficient-that is, the population extracts the maximum possible value 
from the network 



Theorem 1. Any complementary skills network, g & T (\1/), is pairwise stable 
and strongly efficient. 



Proof. See Appendix A.l □ 



2. Skills and Degree in the Collaboration Network 

In the context of a collaboration network, an individual's degree represents 
the number of collaborators she has, and is a proxy for her importance in the 
collaborative community. In this section, I consider the relationship between 
an individual's skill set, and her degree in the collaboration network. 



^^Because the equilibrium links are mutually beneficial, one could functionally think of 
a complementarity network as either directed (because that is how it is constructed) or 
undirected (because that it is how it functions) . For ease of reading, I will usually omit the 
directional arrows from networks pictured in this paper. 

^^This result-that equilibrium networks are efficient-contrasts with two other models of 



social network formation- Jackson and Wolinsky (1996) and Goyal and Moraga-Gonzalez 



(2001 )-in which pairwise stable networks tend to have more links than is efficient. 



COLLABORATION NETWORK FORMATION WITH HETEROGENEOUS SKILLS 14 

2.1. Skills and Degree: An Example. Before presenting general results, it 
is useful to see an example. Consider the example from the previous section: 
each individual faces a problem requiring three skills, S = {a,b,c}, and they 
each have one or two of those skills, so that ^ has support {{a} , {b} , {c} , {ab} , {ac} , {^c}}. 

An individual's in-degree on the network will depend on the number of 
people who need her skills (the demand for a subset of her skills) and the 
number of other people who can provide those skills (the supply of that subset 
of skills). For example, consider someone with the skill set {a}. Her skills are 
in demand by anyone who needs skill a-in this example, anyone with skill set 
{b, c}. A person with skill set {b, c} may collaborate with anyone who has skill 
a, including those with skill sets {a} , {a, b} , or {a, c}. So in this example, the 
expected degree of an individual with skill set {a} is 

^L^^i«>^J-^({4) + ^((a,6}) + *({a,c}) 

Similarly, an individual with the skill set {a, b} can help anyone who needs 
skill a, anyone who needs skill b, and anyone who needs skills a and b, yielding 
expected degree 



M/({a}) + ^({a,6}) + *({a,c}) * ({6}) + ({a, 6}) + ^ ({6, c}) *({6,c}) 

Note that an individual with skills a and b will have more links, on average, 
than an individual with skill a and an individual with skill b put together: 
E[d{{a,b})] > E[d({a})] + E[d{{b})]. This is because an individual with 
both skills can help anyone who needs skill a, anyone who needs skill b, and 

anyone who needs both. This means that an individual with both of those 
skills will have, on average, more than twice the number of links than a person 
with either of those skills in isolation. 
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2.2. Skills and Degree: General Results. A similar type of result holds 
more generally. Theorem [2] states that an individual's expected degree in a 
complementary skills network is a supermodular function of her set of skills. 
That means that regardless of the skill set required for the problem or the 
distribution of skills in the population, an individual with skill set A U B 
will have at least as many links as individuals with skill sets A and B put 
together. The intuition for this result is similar to the above example-the set 
of all problems that can be solved by someone with the skill set AUB includes 
those that can be solved by someone with skill set A, and those that can be 
solved by someone with skill set B, plus those requiring some skills from both 
sets. 

Theorem 2. For any set of skills, S , and distribution of those skills, , 
an individual's expected degree over the networks in F (\[') is a supermodular 
function of her set of skills. That is, Ed {AU B) + Ed {AnB) > Ed (A) + 
Ed{B). 

Proof. Here, 1 will prove the result for the case where an individual needs only 
one collaborator to solve her problem. For the sake of clarity, also 1 consider 
the case where A (1 B = ^. The proof for the general result is similar, and 
can be found in Appendix A.2[ Since d{Ar\ B) = d(^) = 0, we need to show 



that Ed {AUB)> Ed (A) + Ed {B). Consider d{A\JB). The fraction of the 
population needing C C A U 5 is 5(C) = \E'(S'\C). The fraction who can 
supply the set C is a (C) = Ez)c5\c * (C U D). Thus 

E[d{AuB)]- ^ ¥(cI7d)- ^ ^ 

We can partition C C A U i? into three categories: 

1) C C A 

DC^B 
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2>) C <ZAyjB,C <^A ,C <^B. 
This division gives us the following: 

= E[d{A)] + E[d{B)]+^ 
> E[diA)] + E[diB)] 

□ 

This theorem suggests an immediate corollary. 

Corollary 3. Adding skills to an individual's skill set will never decrease her 
degree in a cost minimizing collaboration network. 

Proof. From Theoremg E[d{AU a)]+E [d {A na)] = E [d {A U a)] > E [d (A)]- 
E [d (a)], and so E[d{AU a)] -E[d (A)] >E[d (a)] > 0. □ 

2.3. Skills and Degree: Discussion. The result presented in Theorem[2]has 
some implications for who we think is important in a collaborative commu- 
nity. In particular, it indicates that it is important to consider an individual's 
combination of skills, rather than looking at each of her skills in isolation. For 



example consider the population summarized in the first panel of Figure 2.1 
The problem requires five skills, S = {a, b, c, d, e}, which are distributed across 
a population of = 5n as indicated. Each skill is held by exactly 2n individu- 
als, and thus no skill is rarer than the others. Traditionally, we might condense 
the information contained in this table into a single-dimensional measure of 
ability-individuals 1 and 2 have the most skills, and therefore, we might expect 
them to have the most links. However, despite having fewer skills, individual 
3 will, on average have more links. This is because while neither of individ- 
ual 3's skills are rare by themselves, in combination, they are both rare, and 



COLLABORATION NETWORK FORMATION WITH HETEROGENEOUS SKILLS 17 



Population 1 Population 2 





a 


b 


c 


d 


e 
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Figure 2.1. Two populations of problem solvers with five skills 
5* = {a, b, c, d, e}-n copies of each individual yields a population 
of size N = 5n. 

useful to a large fraction of the population. This means that agent 3 will 
have more links than a tally of her individual skills might predict — in other 
words, the value of a combination of skills may be greater than the sum of 
its parts]^ This further suggests that in problem solving communities such 
as those in knowledge-based industries, entrepreneurial firms, and academic 
research, models in which individuals are scored on a one-dimensional ability 
scale will fail to capture the full effect of variation between individuals. 

Pushing this point a bit further suggests another implication of Theorem [2]- 
because degree is a supermodular function of a problem solver's complete set of 
skills, it is not generically possible to assign prices to individual skills in a way 
that captures her degree on the social network. This means that examining 
the supply and demand of single skills in isolation does not necessarily capture 
an individual's value to a community of problem solvers. 

Corollary 4. There need exist no vector of prices, /i, such that XlaeA /^a ~ 
d{A). 



Examples of this phenomenon are not difhcuh to find. For example, consider a tech 
entrepreneur who has experience in both computer programming and marketing. Neither 
of those skills are rare individually, but together they are quire rare. 
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To further emphasize this point, consider the skill distribution shown in the 



second panel of in Figure 2.1 This distribution is identical to that in the first 
panel, except that individuals 4 and 5 have been given an extra skill. However, 
gaining these skills does not influence the degree of either. In fact, endowing 
them with those skills does not change any part of the degree distribution. 
Skills a and h have value to individuals 1 and 2, but not to individuals 3 
or 4-clearly, no linear weighting of the individual skills could produce such a 
pattern. Two characteristics of this model contribute to this result. First, skills 
are bundled within a person. Thus, in evaluating a collaborator, that person's 
combination of skills must be considered as a unit. Second, the individuals 
in the model have an incentive to minimize the number of collaborators they 
work with. Together, these two factors mean that an individual's value to the 
collaborative community may be more than the sum of her individual skills. 

These results have implications for empirical models. If we assume that 
skills contribute to outcomes independently, then we will underestimate the 
value of particular combinations of skills and overestimate the value of other 
combinations. Moreover, this non-linear relationship between skills and degree 
means that if we observe skills imperfectly, the amount of variation in outcomes 
that is explained by an observed set of skills will decline dramatically in the 
set of skills we can observe. Additionally, the fact that an individual's degree 
depends on both the distribution of skills in the population and the set of skills 
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she already has suggests that in collaborative fields, optimal training decisions 
are highly individualizedp] 



3. The Distribution of Skills and the Structure of the 
Collaboration Network: The Bernoulli Skills Model 

The degree distribution of a collaboration network affects the function of the 
collaborative community. A skewed degree distribution indicates that a small 
number of individuals are involved in most of the collaborations, while most 
people are involved in very few. In this section, I look at how the distribution 
of skills in the population affects the degree distribution of the collaboration 
network. 

3.1. The Bernoulli Skills Model. Thus far, I have considered results for a 
general skill population. However, when we turn to the effects of skill pop- 
ulations on network structure, it is difficult to obtain clear predictions using 
such a general construction. Therefore, in this section I will consider a special 
case, where skills are distributed independently with equal probability-that is, 
Prob {ai G A|aj G A) = Prob {ai & A) = p\f i ^ j E S. I call this special case 
the Bernoulli Skills Model because each individual's skill set can be thought 
of as the result of a set of M Bernoulli trials, each with probability p of suc- 
cess. This means that the distribution of skill set sizes in the population is 
binomial, implying that the fraction problem solvers who have a particular set 
of k skills is (A) = p^ (1 — and the fraction having any k skills is 

^'^To see this, consider the Shapley value of an additional skill. When an each individ- 
ual requires only one collaborator to solve a problem, the individual's degree is , (the 
results are similar for the case where multiple collaborators may be required). Using 
degree, d{Ai) ~ SccA; ^ — ^~^'^~^¥(cud) ^ ^ value function (note that d{.) satisfies 
both d(0) = and superadditivity) , the Shapely value of a skill, a, to individual i: 
^a.^ (d) = EscAMa} (^ccB Eocsucua!'^«CU'')uP) ) ' "^^'^^ depends on both the 

existing skill population and the individual's current set of skills. 
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('^) p'^ (1 — pY^^^ ■ This special case has several characteristics which make it 
a useful starting point. First, because skills are uncorrelated and occur with 
equal frequency, individuals with the same number of skills will, in expecta- 
tion, have the same degree. This means that the only variable of interest is 
skill set size, k. Second, because this model has only 2 parameters-M and p-it 
is easy to see how changes in the distribution of skills in the population affect 
the structure of the collaboration network. 

3.2. Degree Distribution of the Bernoulli Skills Model. Let A denote 
the distribution of expected degree. That is, A (rf) is the fraction of the pop- 
ulation who have expected degree rf, where the expectation is taken over all 
g G r(\E'). In this particular case, I will use a convenient shorthand: Aa^j, 
represents the distribution of expected degree when M skills are independently 
distributed with probability p. 

Theorem |5] states the closed form expression for the degree of an individual 
in the Bernoulli Skills Model. 

Theorem 5. Suppose each individual in a population faces a problem, u {S), 
requiring M skills. If the skills are distributed independently with Proh (a) = 
pVa G S, then the expected degree of an individual with k skills is 

E [d{k)] = p 

Proof. Since S (^) = 2"^, every individual needs to make only one link. Thus, 
we can write E [d (A)] = J2c'ca ' where 6 (C) is the fraction of the popula- 
tion who need skill set C and a (C) is the fraction of the population who can 
provide skill set C. Since the skills are independent, we can separate this sum 
according to the size of the skill set required. If we start with the individuals 
lacking exactly one skill in Ai and end with the individuals needing all of the 



/l - p + p^\'' 

V f J 
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skills, we obtain the following sum: 



E[d{A)] = J2 



k 

E 

i=l 



P 



o • • • 



p^ 



k\ P^'-'{1 -pf 



P 



M 



1 — p + p^ 
p2 



2 \ 



- 1 



□ 

Note that when skills are independent, an individual's degree on the network 
depends only on the size of her skill set, k. Therefore, in the Bernoulli Skills 
Model, it is appropriate to interpret the size of an individual's skill set as her 
"ability"-something that we cannot do in the more general case (recall Figure 



2.1 in the previous section). Moreover, in the Bernoulli Skills Model, degree is 
a superadditive function of the number of skills a person has: E[d{k + 1)] > 
E [d (k)] + E [d{l)]. This suggests a corollary to Theorem [s] 



Corollary 6. Suppose each individual in a population faces a problem, u {S), 
requiring M skills. If the skills are distributed independently with Prob (a) = 
pVa G S , then degree is a superadditive function of the size of her skill set, 
k = \M\. Moreover, her degree is strictly increasing in k. 

However, we still cannot price the skills individually in such a way that we 
capture degree, despite the fact that skills are independently distributed. 

Theorem 7. Suppose each individual in a population faces a problem, u {S), 
requiring M skills . If the skills are distributed independently with Prob (a) = 
pW a G S, then there exists no vector of prices, ^, such that '^aeA^^a■ ~ 
E [d (A)] for all ACS. 
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Proof. Any such vector would be required to set = d (a) = — p) for 

all a e S. But that would imply that d (A) = kp^^'^{l - p) for |y4| = k. This 
is clearly not true for k > 1. □ 



This superadditive relationship between skills and degree has a dramatic 
affect on the distribution of links in the population. Although the distribution 
of "ability" (skill set size) in the Bernoulli Skills Model is binomial, and thus 
symmetric, the distribution of degree is highly skewed. Individuals with a few 
extra skills will have many additional combinations of skills, so individuals 
with marginally more skills will have dramatically more links. As a result, 
the distribution of degree has much higher variance than the distribution of 



ability. Figure |3.1| shows an example of ability and degree distributions with 
M = 10 and p = |. 

The resulting network has a very distinctive "hub and spoke" structure, in 
which a small number of individuals (the hubs) participate in a majority of 
the collaborations, while most (the spokes) participate in relatively few (see 



Figure 3.2 for an example network). Empirical collaboration networks tend to 



have a similarly skewed degree distribution. Figure 3.3 shows an example-a 
coauthorship network of network scientists (data courtesy of Newman (2006)). 
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Figure 3.2. A Bernoulli Skills Network with 1000 problem 
solvers, M = 16 and p = ^■ 




Figure 3.3. A coauthorship network of network scientists 
(Newman ( 2006[ )) and its degree distribution. 



Simlar patterns can be observed in a wide variety of other contextsrj It is 



28t 



see 



(1996 


1, 


Iyer et al. 


(20061) 



jazz musicians (Gleiser and Danon (2003)), and coauthorship networks in a variety of fields: 
Newman] p001[) (ph ysics), |Moody| ( |2004[ ) (sociology), [Goyal et al] ( |2006[ ) ( economics , 



and Acedo et al. ( 2006 1 (management science) . 



COLLABORATION NETWORK FORMATION WITH HETEROGENEOUS SKILLS 24 

worth noting that this is the first decision-based model of network formation 
to rephcate this empirical feature of social networks. Note, in particular, that 
while preferential attachment (a statistical model of network formation) will 
create a skewed degree distribution, the only distinguishing feature of high- 
degree nodes is their age-older nodes have a better chance of benefiting from 
the growth of the network. 

This exaggeration in the degree distribution has an empirical implication- 
individuals who have a disproportionate number of links in a collaboration 
network may not have a disproportionate number of skills. In particular, indi- 
viduals who are only slightly more skillful may be dramatically more important 
in the collaborative community]^ This means that collaborative communities 
may have "superstars'-individuals with a disproportionate influence in the 
community-even when skills are distributed relatively evenly in the popula- 
tion. Moreover, modeling individuals as having skill sets may account for some 
unexplained variation in collaborative outcomes. 

3.3. Problem Difficulty and the Distribution of Degree: Comparative 
Statics on the Bernoulli Skills Model. The hub-and-spoke structure of 
the collaboration network has implications for other behaviors that take place 
over collaborative ties-the more concentrated links are in the hands of a few 
hubs, the more influence they will have over opinion formation, reputation, 
and the decisions over new technology than they would in a network with a 
more equitable distribution of links. In this section, I use comparative statics 
of the Bernoulli skills model to examine how changes in the distribution of 
skills change the amount of inequality in the distribution of links. 



Thus, while great innovators hke Thomas Edison are undoubtedly more skilled than the 
average entrepreneur, they are likely not 1000 times as skilled, as their levels of productive 
output might suggest. 
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In the following, I will use the Gini coefficient as my measure of distributional 



equality (See Appendix A. 3 for a discussion of the gini coefficient in this case). 
Using the fact that AM,p{dik)) = {^^) p'' (l - p)^'''" for k € {0, 1, 2, ...M}, 
the gini coefficient of the degree distribution Am,p is 



G(p,M) = l- 













(1 -p + 


[C-r)"-i] 


-M(l-p) 



As is conventional, values of G {p, M) closer to 1 indicate a more unequal distri- 
bution of degree, while values closer to indicate a more equitable distribution 
of degree. 

Recall that the Bernoulli Skills model has only two parameters: the number 
of skills required to solve the problem (M) and the fraction of those skills 



possessed by the average individual (p). Figure 3.4 shows the Gini coefficient 
for various values of M and Figure |3.6| shows it for various values of p. It 
can be shown that |^ > 0, meaning that as problems require more skills, the 
distribution of degree becomes increasingly skewed towards a few high-degree 
nodes. Similarly, as p — )■ 0, the Gini coefficient decreases meaning that as 
the probability of having a given skill goes down, the network becomes more 
skewed (see Figure 3.6)Qrhese results are summarized in Theorem [s} 



Theorem 8. Suppose each individual in a population faces a problem, u (S), 
requiring M skills and further suppose the skills are distributed independently 
with Prob {a) = p"^ a E S . Let G (p, M) be the Gini Coefficient for the resulting 
degree distribution, Am,p- Then 



"'''Note that The behavior for p near 1 is slightly more complicated. For high values of p, 
many individuals in the network do not need to collaborate to solve their problem, and the 
network starts to break apart, resulting in an uptick in the gini coefficient for p near 1. The 
lower bound on this range is p = .88 . In this paper, I am concerned with cases where 
collaboration is necessary, so the behavior of the gini coefficient in this upper range of p 
values is not particularly pertinent. More detailed information can be found in the proof of 
Theorem [Sj 
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M = 4 M = 8 M = 16 





Figure 3.4. Examples of Bernoulli Skills Networks with = 1000 



p = .3 



p = .5 



p = .7 
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Figure 3.5. Examples of Bernoulli Skills Networks with = 1000 



(1) G {p, M) is strictly increasing in M. That is, the distribution of links 
in the collaboration network is more uneven when the problem being 
solved requires more skills. 

(2) As p ^ 0, G {p, M) is strictly decreasing in p. That is, the distribution 
of links in the collaboration network is more uneven as the probability 
of having each skill gets lower. 



Proof. These relationships are illustrated in Figure 3^ See Appendix |A.4 for 
details. □ 
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Together, these results can be interpreted in terms of the difficulty of the 
problem being faced. A problem is more difficult if it requires many skills 
(M 'I"), or if the average individual has only a few of them {p \-). This means 
that the above can be recast as a comparative static on problem difficulty. As 
problems become increasingly difficult (M and p J,), the degree distribution 
of the network becomes more skewed, and a small number of "superstars" will 
dominate the collaborative community. 

G(p,M) 



0.9 - 



0.8 - 



0.7 - 




M=12 
M=10 
M=8 

M=6 
M=4 



0.2 0.4 0.6 0.8 1.0 



Figure 3.6. The gini coefficient, G{p,M), for different values 
of M and p. 



4. Skill Ladders: Skill Hierarchy and Degree 

In the previous section, I considered a special case in which skills are entirely 
uncorrelated. In this section, I consider the effect of correlations between 
skills. I divide the skills into multiple skill ladders, where the skills within a 
ladder build on one another. I show that when skills are correlated in this 
way, the degree distribution of the collaboration network becomes even more 
unequal than when skills are uncorrelated. This suggests that when skills in a 
community are structured in this way, a very small number of individuals will 
tend to dominate the collaboration network. 
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4.1. The Ladder Model. In order to consider the effect of correlations be- 
tween skills, in this section, I will need a second special case, which I will 
call the Ladder Model. I will define a ladder to be an ordered set of skills, 
L = {ai,a2,a3...ai} C S, such that Pro6 (have aj|have flj+i) = 1 (that is, such 
that any individual who has the i^^ skill in the set must have all of the skills 
that precede it in the set)|^ Here, I consider a special case where the skills 
in 5* are partitioned into m ladders of equal length The set of ladders is 



denoted S = {Li...Lm}- Figure 4.1 shows an example with 12 skills arranged 
into four ladders. 

bs a C3 • da I 



33 



32 



b2 < I C2 



3i • bi • ci * di 

Li L2 La L4 



Figure 4.1. 12 skills arranged into four ladders of equal length. 



I will call an individual who has all of the skills in a single ladder an "expert" 
in that ladder, and 1 will call the set of ladders that individual i is an expert 
in Ai C S. 

One additional assumption will allow me to compare the results in this 
section to the results of the Bernoulli skills model. I will assume that the 
conditional probability of having the next skill in a ladder is the same for all 
skills-that is, Pro6 (have ai\ have aj_i) = p for all ajP^ The probability of 



Page ( 2007 1 introduces this concept of skill ladders. 



Obviously, since the length of a ladder is an integer, there will only be equal-length ladders 
if m divides M evenly. To simplify the exposition, I have written the following as if this 
is true. However, all of the the following results hold if the ladders are equal length up to 
integer constraints, which allows for cases where m does not divide M evenly, 
''"'in other words, I assume that putting a skill at the end of a ladder doesn't change the 
essential difficulty of obtaining that skill. If one instead makes a skill at the top easier to 
obtain (eg: because it builds on previous experience) or harder to obtain (eg: because they 
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being an expert in a ladder of / skills is then pK Thanks to this assumption, 
the case where m = M corresponds to the Bernoulli skills model. On the other 
extreme, m = 1, and all of the skills are arranged in a single ladder. Before 
considering ladders of arbitrary length, I will first look at this case, where 
m = 1. 



4.2. Example: a single ladder of skills. Suppose the skills in the set S 
comprise a single ladder of length M. Because Pro6 (have aj|have Oj+i) = 1 
Vi G S, an individual's skill set can be represented by the number of skills she 
has {\Ai\ = k implies Ai = {ai,a2---ak}) ■ Because individuals can be ranked 
in order of the number of skills they have, this case corresponds to a model in 
which each individual has a one-dimensional ability measure. 

The linking behavior in this case is very simple. The only individuals who 
have skill are those who also have skills ai...aM-i- Anyone who doesn't have 
all M skills links to someone who does. The resulting collaboration network is 
a set of isolated stars, each with links, on average. Figure 



4.2 



compares 



the network structure in the case with one skill ladder (m = 1) to the network 
structure in the Bernoulli skills model, where skills are independent (m = M). 
The populations that make up these two networks are similar-they have the 
same number of problem solvers, the same number of skills, and the same 
probability of having an additional skill. This means that the probability of 
an individual having all of the skills required to solve the problem is the same 
in both networks. Moreover, in both, exactly one individual has all of the skills 
required. However, the two networks have a much different structure. When 
all of the skills build on each other, the network is centered around a single. 



are more demanding than the skihs that came before), then the resuhs remain quahtatively 
unchanged. 
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m = 1 ladder m = M ladders 



Figure 4.2. Ladder Skills Networks with iV = 27 and M = 3. 

high-degree node. When skills are independent, the network structure is much 
more distributed^ 

4.3. Results for m ladders. Now consider the more general result. Suppose 
the skills are arranged in m equal-length ladders. As in the previous example, 
only experts obtain links. Thus, a model with m ladders reduces to a Bernoulli 
skills model with m independent skills. Theorem [9] presents a closed-form 
expression for a individual problem solver's degree in the case with m skill 
ladders. 



■^^Note that this could be interpreted as a prediction about organizational structures in 
different industries. In industries where value is created through the exploitation of exist- 
ing knowledge (eg: manufacturing), skills tend to build on one another, and it is viable 
to rank workers by ability. However, as demand for workers shifts towards industries that 
create value by generating new knowledge, we expect skills to be arranged in ladders less 
often. This model predicts a corresponding shift from hierarchical organizational structures 
towards more distributed organizational structures. Empirical evidence seems to support 
that prediction. Traditional organizational structures were hierarchical (the theoretical un- 
derpinnings are explored in |Rosen| ( jl982| )). However, evidence indicates that organizational 
structure within firms is changing-hierarchical structures are flattening, and workplaces are 
becoming more decentralized (see, for example, Bresnahan et al. (2002) and Rajan and Wulf 

pooel)). 
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Theorem 9. If is a distribution of skills such that S = {Li...Lm} is a 
partition of S into m equal-length ladders with Prob ( have al\ have a' 



(i-i) 

p, then an individual with the skill set A will have expected degree E [d{Ay\ 

k 



P I 2M 

\ p m 

is an expert in. 



, where k is the number of disciplines the individual 



Proof. The ladders are of equal length, so the length of a single ladder is 
and the probability that an individual is an expert in any one ladder is p^ . An 
individual receives a link only if she is an expert in a field. Define a new set of 
skills that correspond to the set of ladders: S = {Li...Lm}- The individual's 
new skill set is Ai, where Lk G Ai if she is an expert in ladder Lk. Each of 
these new skills has a probability equal to the probability of being an expert 
in that field, so define p = p*^/™. The probability of being an expert in a 
particular ladder is independent of the probability of being an expert in any 
other ladder, so this problem reduces to one with m independent skills, with 
probability p^ . The result then is a simple extension of Theorem [sj □ 

We can now do a comparative static on the number of skill ladders, to 
see how the amount of hierachy in the skill pool affects the structure of the 
collaboration network. Theorem [lO] indicates that in communities where skills 
are more hierachical, the degree distribution of the collaboration network is 
more skewed, and a few experts become extremely high-degree hubs in the 



network. Figure 4.3 shows how the gini coefficient depends on the number 



of ladders for the case where M = 6 (note that the bottom curve (m = 6) is 



equivalent to the Bernoulli Skills Model with M = 6 in Figure 3.6). 



Theorem 10. Suppose S skills are arranged in m ladders of equal length, with 
constant conditional probability Prob {have al\ have a^_i) = p and Prob [have a{) 
p Wj = l...m. The gini coefficient of the resulting network is decreasing in the 
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Figure 4.3. The gini coefficient, G{p,M), for different values 
of m and p. 

number of ladders, m. That is, when there are fewer skill ladders, the degree 
distribution becomes increasingly uneven. 

Proof. A model with m skill ladders is equivalent to a Bernoulli Skills model 
with m skills and p = p^^/™. The gini coefficient is thus G (p, Since this 
function is increasing in the second argument (see Theorem [sj, it is decreasing 
in m. □ 



One of the main contributions of the model presented above is it's extremely 
general treatment of individual heterogeneity. Although it is possible to model 
the heterogeneity of problem solvers more simply-for example, via a one- 
dimensional ability levelQ by giving each individual a type or speciality^ 
or some combination of the twcp^it is not clear that those models capture 
what distinguishes one problem solver from another. The model presented in 
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5. DiSGUSSION: THE IMPORTANCE OF HETEROGENEITY 
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this paper encompasses both abihty-based and type-based models]^ while also 
allowing for other, more complex types of heterogeneity. This finer-grained ap- 
proach has significant impacts on outcomes. In this section, I examine those 
impacts by directly comparing a model in which individuals have "types" to 
one in which individuals have overlapping sets of skills. 

Suppose there are two different problem solving populations, faced with the 
same problem, oj = {a,b,c}. In population A, every individual has exactly 
one skill, which we can think of as her speciality or "type": \1/ (a) = ^, \I' (6) = 
|,\I'(c) = |. In population B, the skills are distributed independently. This 
means that and the probability of having skill set A is \l/ (A) = HieyiPi- Let 
Pi = Prob {have skill i) = ^ for i = a,b, c. 

These two populations have much in common: in both, the three skills oc- 
cur in equal proportion, and both populations average one skill per person. 



However, as illustrated in Figure 5.1, the collaboration network is much dif- 
ferent in the two different populations. This pair of examples highlights the 
value of a more detailed treatment of skills in modeling problem solving. The 
assumption that every individual has a type or specialty is not necessarily 
benign-it impacts our predictions about the value of certain individuals in the 
community, as well as impacting the structure of the collaboration network. 

6. Discussion of Implications, Extensions, and Conclusion 

In this paper, I have presented a model of collaboration network forma- 
tion in which individuals have heterogeneous skill sets and collaborate to solve 
problems that none of them could solve as individuals. The result is a collabo- 
ration network, the structure of which depends on the distribution of skills in 
the underlying community. I have showed that when individuals have complex 



■^^As mentioned earlier, a ladder model with m = 1 ladder is equivalent to an ability measure. 
A type-based model is equivalent to a population where each individual a single skill (as in 
the below example). 
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Population A Population B 

(one skill per person) (skills independent) 




expected degree expected degree 

Figure 5.1. A population with types and a population with 
independent skills. 

skill sets, their combination of skills is important, meaning that their degree 
in the network cannot be captured via a linear pricing system. Using a special 
case of the model in which skills are distributed independently, I show that 
small initial differences in skill sets are magnified in the network into large 
differences in degree. This means that the network can have a skewed degree 
distribution, even when skills are distributed symmetrically. This corresponds 
with an empirical fact: the degree distribution of collaboration networks is 
highly skewed, despite relatively low variation in the perceived distribution 
of ability. Finally, I show that as problems become more difficult, the distri- 
bution of degree in the network becomes increasingly skewed towards a few, 
high-degree "super stars". 

The more general treatment of individual heterogeneity that I present in this 
paper has some interesting implications for labor markets knowledge-based 
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fields, where workers are richly heterogeneous and collaborate to solve prob- 
lemsj^ In particular, given that there is a relationship between the number of 
collaborators an individual has and her output]^ the model presented in this 
paper can provide insight into individual heterogeneity and output. The re- 
sults of Section [5] strongly indicate that the decisions we make when modeling 
heterogeneity matter. A full exploration of the effects of complex heterogene- 
ity on labor markets is outside the scope of this work, but I will provide a brief 
discussion here. 

Empirically, output in creative and knowledge-based fields is highly con- 
centrated among a small number of people]^ This long-tailed distribution of 
productive output has implications for the distribution of wages and welfare, 
and thus there has been considerable interest in understanding why such a 



concentration in labor demand occurs. Rosen (1981)'s model of production 
can induce a long-tailed distribution when there is a high premium on quality, 
and production technology decouples effort from output quantity (eg: in cre- 
ative industries, where a single performance or album can be enjoyed by many 
consumers). However, such technologies are not relevant in knowledge-based 
industries, where effort is not decoupled from output volume. The long-tailed 
distribution of collaborative interactions in this paper suggests an alternative 
explanation for the observed variation in output. Moreover, the prediction that 



"^^If we were to recast the relationship between collaborators as a relationship between 
employees and firms, then an extension of this model would also provide insights into the 
affect of heterogeneity on labor in a non-collaborative context. 

^'^In the Bernoulli Skills model, where each individual has a complement, the relationship is 
one-to-one. 

Goyal et al.l ('2006') show that a 



4] 



Newman 



(2001), Moody (20041, 



Acedo et al. 



(2006), and 



small number of academics produce the majority of papers written, Rosen^ (,1981) ob serves 
a similar pattern in in music, film, and textbook writing, and Uzzi and Spiro (20051 notes 
a similarly skewed distribution of output among directors, producers, and other creative 
artists on Broadway. 
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output will become more skewed as problems become more difficult suggests 
an empirical test in the context of aggregate labor demand. 

The non-linear relationship between skill sets and output also has impli- 
cations for the explanation of variation in labor outcomes when individual 
heterogeneity is imperfectly observed. If we assume that skills make a linear 
contribution to output, then we would expect a linear decline in the amount 
of variation explained. However, if the relationship between individual skills 
and output is non-linear, as is suggested by the model in this paper, then 
the amount of variation in output explained by observables will drop nonlin- 
early as well. This suggests that decreases in the amount of variation in labor 
outcomes explained by observed agent heterogeneity could be the result of a 
shift from manufacturing-where skill could reasonably be considered to be a 
one-dimensional variable-to knowledge-based industries, where individual het- 
erogeneity is more complex. A more thorough exploration of the implications 
of skill-based labor heterogeneity on labor markets would make an interesting 
extension. 

Turning to future work, the framework that 1 have introduced in this paper is 
extremely flexible and has the potential to provide insights into a wide variety 
of questions that could not previously be answered. Moreover, the results of 
this paper are evidence that more complex models of individual heterogeneity 
are both tractable and can provide the means for addressing a wide range of 
new and important questions surrounding problem solving and innovation. 
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Appendix A. 

A.l. Pairwise Stability and Efficiency. Briefly, a network is pairwise sta- 
ble if no individual would prefer to terminate an existing link, and if no pair of 
individuals would prefer to add a link. Although this definition is usually used 
in undirected networks, it works equally well in the current context. Formally, 
a directed collaboration network, g, is pairwise stable if 

(1) for all ij e g, Ui (g) >Ui{g~ ij) and uj (g) > Uj {g - ij) 

(2) for all ij ^ g, if Uj {g + ij) > Uj (g) then Ui {g + ij) < Ui (g) 

Together, these two conditions ensure that links are mutual. That is, if a 
network is pairwise stable, then both ends agree to maintain the link. 

Theorem. Any complementarity network, g G r(^), is pairwise stable. In 
other words, Wij G g Ui (g) > Ui{g — ij) and Uj (g) > Uj {g — ij) and for all 
U ^ 9, Uj {g + ij) > Uj (g) then Ui {g + ij) < Ui (g). 

Proof. First, consider whether any individual wishes to unilaterally remove a 
link, ij e g. Removing this incoming link costs individual j her share of the 
payoff from solving i's problem ^ \c]+i — ^^^^ '^^^ never choose 

to do so. Individual i chose a minimal set of collaborators that allowed her to 
solve her problem, so removing this outgoing link means that she can no longer 
solve the problem. This would cost her the payoff (^jcq+T — from solving 
the problem, and thus she will also never choose to do so. Finally, note that 
no individual would ever want to add an outgoing link to a cost-minimizing 
collaboration network, because having chosen a minimal set of collaborators, 
any additional link would require her to further split her prize. □ 

Theorem. Any cost minimizing collaboration network, g G r(^'), is strongly 
efficient. In other words, '^^Ui (g) > '^^Ui {g') \/ g' & G. 
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Proof. Because all value is generated from solving problems, the maximum 
possible value in the network is A^. Since solving problems is incentive com- 
patible and there is no loss, the problem solvers always extract the maximum 
value from the network. □ 



A. 2. General Proof of Theorem [2l An individual with the set A U B 
will be able to help anyone needing any subset of those skills. Let 6 (C) be 
the demand for a particular set of skills, C. In the general case, 6 (C) = 
\1' {S\C) + J2d-^{cud)=o ^ ("^V (C* U D)). The fraction who can supply the set 
C is a (C) = Y.Dcs\c ^ (C U D). Note that 6 (C) and a (C) depend only on 
the particulars of the problem (S), the distribution of skills (\E'), and the sub- 
set of skills (C). Thus, any individual with the skill set A U 5 has expected 
degree 

CCAUB ^ ' 

We can divide the problems that an individual with AVJ B can solve into 
three groups: 

(1) Requires only skills from set A: C A 

(2) Requires only skills from set B, including at least one found only in B 
: {C\C CBand3beCstbe B\A} 

(3) Requires at least one skill from each set that can only be found in that 
set: {C \ C C All B, where 3a,beC s.t. a G A\B and b G B\A} 

Using this partition, we can write 



CCA ^ ^ CCBandCnBf!=lll ^ ' CCAuB andCnA, CnB^0 ^ ' 

= E[d{A)]+ ^ + '^ 
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which imphes that 

HC) 



E[d{AUB)]+ E[d{AnB)] = E[d{A)]+ 



CCBandCnBy^d 



a{C) 



+ (t) + E[d{A^B))] 



5{C) 



= E[d{A)]+E[d{B)]+(j) 
> E[d{A)]+E[d{B)] 

A. 3. Discussion of the Gini Coefficient in the discrete case. The Gini 

coefficient measures the area between the Lorenz curve of a distribution (in 
this case, the distribution of expected degree), and the hue of equahty. In 
the case of a discrete distribution with values yo...yN where i/i < yi+i, the 
Lorenz curve is a piecewise function connecting points {Ei,Di) where Fi = 
^^^p A is the fraction of individuals with strictly less than yi links, and 
Di = ^fT" ^^/"'l^'" is the fraction of the total number of links they hold. See 



Figure A.l for an example. The gini coefficient for a discrete distribution is 
given by G = 1 — J2iLi i^i ~ -^j-i)- Lower values of the gini coefficient 
indicate a more equal distribution of links across individals in the population, 
and higher values indicate a more skewed distribution of links. The coefficient 
is which is when the distribution is perfectly equal (ie: the bottom x% of 
the population holds exactly x% of the links) and 1 when all of the links are 
held by a single individual. 



A.4. Proof of Theorem |8[ Recall that since Aa/,p (rf (A;)) = (f) / (1 - p) 
for k G {0, 1, 2, ...M}, the gini coefficient of the degree distribution Am,p is 



M-k 



G(p,M) = l- 







" / , o\ M 




pM 


(1 -p + p2) 


(^) -1 


- M(l -p) 
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line of equality / 









% of players with < d links 



Figure A.l. An example of the Gini coefficient for a discrete 
distribution, A [y). In this case, the random variable y takes on 
one of five values, yo...yi. The Gini coefficient is the area of the 
shaded region between the line of equality and the Lorenz curve. 



G(p,M)o 




Figure A. 2. Plot of the Gini Coefficient for values of M and p. 
Figure A. 2 shows the Gini Coefficent of Am,p for values of M and p. 



First, show that G (p, M + 1) - G (p, M) > 0. 

G{p,M + 1)-G {p, M) = (i-p)- [{{i-P+P^r^'-P^^^-^) (i-P-P^O 



_(M+l)(l-p)-p2+(i=£±£^) ' (l-p+p2) ■ (l-p+p2)2_p2((2_A./)(l_p)+p2 

Thus, G{p,M+ 1)-G {p, M) > if ((1 - J9 + _ p2M+2j (1 _ p _ pM^ + 



p 



2A/+1 



(M + l)(l-p) > 0. Since (l - p) > (l-p-p*^), this is true if 
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((1 - p + _ p2M+2^ (1 _ p _ ^ p2M+l (^M + l){l-p- p^) > 0, 

which is true if (1 - p + p2)M+i_^2M+2^p2M+i (M + 1) > 0. Since {1 - p + p'^) > 
p for p e [0, 1], the previous is true if - p2M+2 _^ p2M+i _|_ i) > g, 

which is true if 1 - p^+^ + (M + 1) > 0. Since 1 - + (M + 1) = 
1 + p^ (1 - p) + Mp^ > 0, the result holds. 

Now, show that G(p, M) is strictly decreasing in p as p — >■ 0. 

Note that G (p, M) is strictly decreasing in p for all p < T (p, M) where 
T(p, M) is a boundary condition given by 

(/I — -I- 2\ ^ 
-1 + 2p - 2p2 + p'^ + ( j ((2 - p) (1 - p) p"^ - (1 - p)' (1 + p)) - M (1 - p)' 

Through numerical analysis, T {p, M) < .88 for all (p, M). Thus, G{p,M) is 
decreasing in p for p < .88, and the result follows 



